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INHIBITION OF NON-CD4 MEDIATED HIV INFECTION 



TRPHNTCAL F TET T> OF THE INVENTION 

5 

The present invention is directed to a non-CD4 cell surface receptor for gpl20. 
This gpl20 receptor (gpl20r) has been isolated and cloned and is utilized in the present 
invention in methods and kits for the inhibitton and detection of HTV infection. 

10 HAfTtTfiROTIND OF TTTR INVENTION 

Two types of human retroviruses have been identified, leukemia viruses and AIDS- 
related viruses. The primary targets of the human retroviruses are T lymphocytes and cells 
of the central nervous system. All human retroviruses are transmitted by intimate contact, 

15 blood amiamination, and infection in ujejfi or after birth by milk. It is likely that all 
human retroviruses originated in Africa and that they encountered the human species via 
interspecies infection, possibly from African green monkeys or a related species. The 
human retroviruses first discovered, Human T Lymphotropic Virus Type 1 (HTLV-1) and 
Human T Lymphotropic Virus Type H (HTLV-ID , have a preferential tropism for T4 cells 

20 and some T8 cells, share significant sequence homology, and are mainly associated with T 
cell leukomas and lymphomas. The other group of human retroviruses, generally called 
Human Immunodeficiency Viruses (HTV), is discussed in greater detail below. There are 
two major differences between the two types of human retroviruses: (1) there is substantial 
genomic variability among various HTV isolates, whereas the genomes of HTLV-I and 

25 HTLV-H are stable; and (2) HTV entered human populations much more recently than 

HTLV-I or HTLV-II. 

The human immunodeficiency virus (HIV) is a cytopathic retrovirus and the 
causative agent of the acquired immunodeficiency syndrome (AIDS). Two forms of HTV 
have now been identified. The prototype virus, HTV-1, previously termed 

30 lymphadenopathy-associated virus (LAV) and Human T Lymphotropic Virus Type m 
(HTLV-m), is responsible for the vast majority of reported AIDS cases worldwide. 
Another retrovirus, HTV-2, has been isolated primarily from West African patients with 
AIDS and is pathogenically related to HTV-1 . On the genetic level, HTV-2 is actually more 
closely related to the simian immunodeficiency virus (STV), a retrovirus infecting 

35 monkeys. 

Over half of the people that have contracted AIDS in the United States have already 
died. As many as three million persons in this country may be asymptomatic carriers of 
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HIV and are capable of transmitting the virus. It had been estimated in 1986 that 270,000 
cases of AIDS will have occurred in the United States by 1991 (U.S. Public Health 
Service, (1986), Public Health Rep. 101:341). The mortality rate from AIDS is 
disturbingly high, exceeding 80% within three years of diagnosis and possibly reaching 
5 100% over a longer period. 

Worldwide, the AIDS epidemic may involve some five to ten million presently 
infected persons. Particularly troublesome are statistics from the African continent where 
millions of individuals are believed infected with HIV, deaths range in the hundreds of 
thousands, and heterosexual transmission predominates. To date, there is neither a known 
10 cure for AIDS nor an effective vaccine against HIV infection. 

HIV is a member of the nontransforming, cytopathic lentivirus family of 
retroviruses. HTV causes a typically fatal disease characterized by severe 
immunodeficiency or neurodegenerative disease, or both. The primary basis for HIV 
induced immunosuppression is the depletion of the helper/inducer subset of T lymphocytes 
expressing the CD4 molecule <T4 or CD4 + cells), which serves as a high affinity cell 
surface receptor for the virus. T4 lymphocytes are involved directly or indirectly in the 
induction of nearly every immunologic function in the body, and their depletion results in 
susceptibility to a wide range of opportunistic infections and neoplasms. 

In addition to the T4 lymphocyte, other cells expressing the CD4 molecule are 
20 targets of HTY infection, especially monocyte-macrophages. HIV infection also results in 
serious B cell abnormalities including polyclonal activation, hypergammaglobuhnemia, 
elevated levels of circulating immune complexes, and autoantibodies. A decreased number 
of functional natural killer (NK) cells have also been observed in AIDS patients. 

Infection of CD4 + cells is initiated by the interaction of the CD4 molecule with the 
25 major HTV envelope glycoprotein gpl20, an event which is followed by mtemalization and 
uncoating of the virion, transcription of genomic RNA to DNA by virus-encoded reverse 
transcriptase, and integration of the resulting proviral DNA into host cell chromosomal 
DNA. Also, unintegrated proviral DNA accumulates in large amounts within infected cells 
and is probably a significant factor in HTV cytopathology (Shaw et al., (1984) Science 
30 226:1165). 

The depletion of CD4 + T cells appears to contribute significantly to the 
immunosuppression associated with AIDS. A primary cytopathic effect of the virus in 
vitro is HTV-induced syncytium formation. CD4, through its interaction with gpl20 plays 
an important role in syncytium formation. However, it has been observed that molecules 
35 on the cell surface of uninfected cells other than CD4 are also involved in HTV-induced 
cell fusion (Hildreth etal. (1989) Science 2J4.:1075-1078). 
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Mection by HIV produces, in addition to AIDS, a set of neuropsychiatry disorders 
which are called the AIDS dementia complex (ADC) (Price et al., (1988X222:586-592). 

The symptoms of ADC include cognitive impairment, apathy and motor 
dysfunctions, and may affect as many as 90% of AIDS victims. Hie underlying cause of 
5 ADC appears to be the death of brain cells and mv-1 can be isolated from the brains of 
infected individuals (Ho et al, (1987) N. Eng. J. Med. 212:278-286). 

An early study suggested that the cellular attachment site for HIV in brain might be 
CD4 (Pert et al., (1986) Proc. Nad. Acad. *L USA fi*St«Z*> but attempts* 
replicate these findings were not successful (Kozlowski et al., (1989) Neurosci. Abstr 
10 15:671). It now appears unlikely that the CD4 antigen is mvolved m ^infection o 
brain-derived cells by HIV. Susceptibility of brain cells to infection with HIV-1 does not 
correlate with the level of expression of CD4 (Chang-Mayer et al., (1987) ^Proc. Natl 
Acad Sci USA 84:3526-3530; Srinivasan et al., (1988) Arch. Virol. 28:135-141), and 
infection of brain-derived cells by HIV-1 is not blocked by anti-CD4 antibodies (Clapham 
15 et al., (1989) Nature 222:368-370; Li et al., (1990) I. Virol. £4:1383-1387). 

The present invention demonstrates the presence of a non-CD4 receptor for gpl20 
and a method for the inhibition of HIV infection of cells such as brain and muscle which 
do not express high levels of CD4. 

20 ^T JUMARY OF TRR INVENTION 

Many cells mat are susceptible to HIV infection appear to bind gpl20 through a 
non-CD4 surface protein. The present invention has identified this non-CD4 gpl20 
receptor (gpl20r) and has recombinanfly expressed and characterized gpl20r. 

25 In this invention a specific non-CD4 gpl20r has been isolated which has specific 

binding activity for gpl20 present on Human Immunodeficiency Virus-1 (HIV). This 
gpl20r has a molecular weight of about 45, 000 daltons, contains about 400 amino acid 
residues and is characterized by a Kd for gpl20 of about 1.3 nM to about 2.0 nM. The 
binding of gpl20 to gpl20r is inhibited by specific carbohydrates, such as mannose and 

30 fucose, plant lectins such as concanavalin A and specific antibiotics, such as pradimicm A. 

In one embodiment of the present invention, a cDNA molecule that transcribes an 
mRNA encoding for gpl20r is cloned and expressed to produce gpl20r. The DNA is 
selected from a gene library obtained from tissue such as placenta, brain, muscle and 

colon. . . 

35 A method of inhibiting HIV infection of mammalian cells, such as brain, muscle 

and neural cells, is contemplated by the present invention. In this method, cells are 
contacted with an effective amount of an appropriate inhibitor of gpl20r binding for a time 
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^ ***** » significantly inhibit the binding of HIV to 
^Or Specific inhibit of gplMr binding include manno* caAohydrates, fuco« 
caibohydow, plant le^.amlaiittt.ioticssncliaspndManA. . atitfcr 
m gp^Or of ft. P**». invention can ala, be utilized h. -M -d . «fc 
, • ».i..-™» n r.ofmVinafliiidsample. to tMjmethoi, the binding of HIV 

:Sgp^ 

to a solid matrix to form a solid support that is useful in this method and/or kit. 



nrnrpTr™™ THE FIGURES 

In the drawings: 



20 B 



HGURE 1 illustrates expression cloning of the gpl2Qr cDNA and comparison to 
a * Autoradiography of gp!20 binding to gpl20r and °* «^ * « 

200 nM unlabeUed bgp!20; D, CD4; E, CD4 with GIT^F CD4 w*h 
bgpl20. G-L [^ngpUO; G, gpl02r, H, gpl20r with 110 1; I, gpl20r 
with bgpl20; I, CD4: k, CD4 with 110.1; L, CD4 with 

L CD4. A+G, HIV antisera (1:20; Trimar); B+H, D-galactose (100 mM), 
C+I, D-mannose (100 mM); D+J, L-fucose (100 mM); E+K, 
Coninavalin A (1 mg/ml); F+L, pradimicin A (100 A»g/ml). 
C: gpl2tebindingofHIV. A, HIV; B, HTV with 200 nM bgpl20. 

FIGURE 2 illustrates the characterization of the gpl20r. 

A . scatchard analysis of [ 125 rjgpl20 binding. A - A, vgpl20 ; to 
placenta, Kd 1.3 nM, 19 fmol/mg protein; ■ with «/ml G17-2; ,•- 
• vgP 120 binding to gpl20r COS cells, Kd 1.7 nM, IW 150,000 

receptors/ceu^ _ ^ 

B: Inhibition of [^rjgpHO binding to gpl20r COS cells. Open symbols 
ngpl20, filled symbols vgpl20. Hie relative values were the same with 

both forms of gpl20. M^-^-^^«f* 6 

„g/nH); •, L-fucose (K j 6 mM); A, a-methyl D-mannoside (K. 15 mM), 
O, D-mannose (K. 23 mM); O, N-aatylgluwsamine (K. 70 mM), ■, 
EGTA (K j 0.3 mM). 
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C Intemalizationofgpl20bygpl20rCOScells. Points present the mean of 

two experiments with vgpl20 and ngpl20. • - •, sufece; O - O internal. 
D: Placenta control sera; 2, placenta HIV sera; 3, gpl20r COS control sera; 4, 

gpl20r COS HIV sera. 
E- Northern blot of gpl20r expression. Polyadenylated (A+); 2, placenta; 3, 
thymus; 4+12, fbiebrain; 5, skeletal muscle; 6, heart; 7, liver, 8, kidney; 
9, colon; 10 medulla; 11, cerebellum; 13, T cell (CEM; 16 n% A+) 14, B 
cell (IS-l; 16 M A+); 15, macrophage (U937; 8 v% A+); 16, cervical 
carcinoma (HeLa; 16 M g A+). Hie different apparent size of the "5 kb 
band is an artifact of displacement by 28S rRNA. 

FIGURE 3 illustrates the sequence analysis of the gpl20r. 

A: Nucleotide and deduced protein sequence of gpl20r cDNA. 

B: Hydropathicityplotofthegpl20r. The predicted transmembrane segment 

and the start of the eight amphipathic repeats are indicated by arrows. 
C: Aminoacid alignment of the gpl20r C-type lectin domain. 

HIV infection of brain and muscle cell lines is not blocked by soluble CD4 or anti- 
CD4 antibodies (Oapham, P.R. et al., (1989) Nature 322:368-370; Harouse, J.M. et al., 
(1989) J Virol. 61:2527-2533; Weber, J. et al., (1989) J. Gen. Virol. 22:2653-2660). 
This is consistent with the existence of a second gpl20 receptor. Binding studies indicated 
that human placenta was another source for a non-CD4 gpl20 receptor, and a cDNA for a 
second gpl20 receptor (gpl20r) was isolated by the present invention from a placental 
library. The gpl20r has a higher binding affinity for gpl20 thanCD4. Sequence analysis 
revealed homology to membrane associated C-type lectins, and inhibition studies have 
shown that the receptor binds gpl20 through a mannose or fucose containing carbohydrate. 
The gpl20r rapidly internalizes gpl20, and is expressed in placenta, thymus, muscle, and 
colon These results, when considered with previous studies on the role of gpl20 
carbohydrate in HTV infection (Lifson, J. et al., (1986) J. Exp. Med. 164:2101-2106; 
Ezekowitz, R.A.B. et al., (1989) J. Exp. Med. 1£?_:185-196; Larkin M. et al., (1989) 
AIDS 3_: 793-798; Tanabe-Tochikura A. et al., (1990) Virology 126:473-476), suggest a 
potential role for the gpl20r in HTV infection or pathology. 

The present invention demonstrates that the gpl20r participates in cellular binding 
of HIV by a non-CD4 pathway in muscle and brain, as well as, facilitating virus 
attachment in CD4 positive cell types. It is likely that the gpl20r plays a significant role in 
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transplacental transport of HIV (Zacher, V. et al., (1991) J. Virol. £:2102-2107) and 
colon infection (Bamett, S.W. et al. (1991) Virol. 182:802-809). Gpl20 produces an 
increase in intracellular calcium in rat retinal ganglion cells (Dreyer, E.B. et al., (1990) 
Science 24£:364-367) suggesting that the gpl20r or a homologous protein may have 

5 signaling functions in the nervous system disrupted by gpl20 leading to HIV neurotoxicity. 

In the present invention, a new non-CD4 binding protein, or receptor, for gpl20 
was isolated. The HIV surface protein gpl20 was found to bind to a receptor on human 
placental membranes that was not blocked by antibodies directed against CD4, such as 
G17-2 and OKT4a, and which interfere with gpl20 binding to CD4. A cDNA encoding 

io this receptor was isolated from a placental cDNA library in a mammalian expression vector 
(pCDM8). The gene products were expressed in COS cells and were screened by I- 
labelled gpl20 binding. From a pool of 90,000 cDNA molecules, a single clone was 
isolated that encoded a protein which bound gpl20, even in the presence of concentrations 
of anti-CD4 antibody (G17-2) which completely blocked gpl20 binding to CD4. 

15 Sequence studies were carried out and indicated mat the 1.5 kilobase cDNA clone 

encoded a previously unknown member of a family of Type H membrane proteins with an 
extracellular C type lectin domain. 

The cloned gpl20r of the present invention binds gpl20 with an affinity (Kd) of 
about 1 to 2 nM, which is considerably greater than the affinity of CD4 for gpl20 (about 

20 Kd = 4 nM). 

The binding of gpl20 to gpl20r is not blocked by polyclonal HIV antisera, but is 
inhibited by mannose carbohydrates, fucose carbohydrates, plant lectins such as 
concanavalin A and pradimicin A antibiotics. Other sugars such as N-acetyl-d-glucosamine 
and galactose are less potent inhibitors. 

25 The gpl20r is expressed on many mammalian cells which do not exhibit high levels 

of CD4, such as placenta, skeletal muscle, brain, and mucosal cells. Other tissue and cells 
displaying gpl20r include colon, thymus, heart, T cells, B cells and macrophages. The 
distribution of tissue having gpl20r parallels that for binding of gpl20 which is not 
blocked by CD4 antibodies, and for HIV infection which is not neutralized by soluble 

30 CD4. This observation suggests a role for gpl20r in viral infection. 

In gpi20r expressing transfected COS cells, gpl20 is rapidly internalized following 
binding to gpl20r. This binding and internalization of gpl20 is inhibited by compounds 
such as mannan, concanavalin A and pradimicin A. 

In the present invention a cDNA which encodes gpl20 was isolated and cloned. A 

35 DNA molecule of the present invention corresponds to a complementary DNA molecule 
which transcribes a messenger RNA (rnRNA) molecule which, when translated, encodes 
gpl20r. The cDNA molecules were obtained by reverse-transcribing rnRNA molecules 
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isolated from mammalian tissue such as placenta, colon, brain or thymus. The 
transcription and cloning of cDNA molecules and isolation of gene products are techniques 
well known in the art and, for example, are described in Sambrook et aL, "Moteffllar 
Cloninp: A Laboratory Manual ". 2d edition, Cold Spring Harbor Lab., Cold Spring 

5 Harbor, NY (1989), which is incorporated herein by reference. 

As used herein, the phrases "physiologically tolerable" and "pharmaceutical^ 
acceptable" refer to molecular entities and compositions that do not produce an allergic or 
similar untoward reaction, such as gastric upset, dizziness and the like, when administered 
to a mammal. The physiologically tolerable carrier may take a wide variety of forms 

10 depending upon the preparation desired for administration and the intended route of 
administration. 

A carrier is a material useful for administering the active compound and must be 
"acceptable" in the sense of being compatible with the other ingredients of the composition 
and not deleterious to the recipient thereof. 
15 The pharmaceutical compositions are prepared by any of the methods well known in 

the art of pharmacy all of which involve bringing into association the active compound and 
the carrier therefor. 

For therapeutic use, the agent utilized in the present invention can be administered 
in the form of conventional pharmaceutical compositions. Such compositions can be 
20 formulated so as to be suitable for oral or parenteral administration, or as suppositories. In 
these compositions, the agent is typically dissolved or dispersed in a physiologically 
tolerable carrier. 

As an example, the compounds of the present invention can be utilized in liquid 
compositions such as sterile suspensions or solutions, or as isotonic preparations containing 

25 suitable preservatives* Particularly well suited for the present purposes are injectable 
media constituted by aqueous injectable isotonic and sterile saline or glucose solutions. 
Additional liquid forms in which the present compounds may be incorporated for 
administration include flavored emulsions with edible oils such as cottonseed oil, sesame 
oil, coconut oil, peanut oil, and the like, as well as elixirs and similar pharmaceutical 

30 vehicles. 

The present agents can also be administered in the form of liposomes. As is known 
in the art, liposomes are generally derived from phospholipids or other lipid substances. 
Liposomes are formed by mono- or multi-lamellar hydrated liquid crystals that are 
dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
35 metabolizable lipid capable of forming liposomes can be used. The present compositions 
in liposome form can contain, in addition to the agent of the present invention, stabilizers, 
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10 



IS 



20 



25 



30 



35 



p^ervanves.e^.s, and .belike. The preferred lipid. are me phospholipids and the 
phosphatidyl cholines Oedftins), both natural and synthaic. M 
P ^MeLds^fbrmUpo^^knownin^art S * ta *~^ 
■^.^BiotoEV. Volume XIV. Academic Press, New York, N.Y. (1976) P 33 

The present compounds can also be used in conations such as tabim or pffls, 
p^Lairdng dose of the impound. ™ 

ingredient) is mbted with conventioiial tabletting ingredients such as com f™*^^** 

mm**.**-* dicaicium ,r?,r 

JZ, compositions can be laminated or otherwise conned u, pro^e ua« dosage 
fonns affording prolonged or delayed action. ,._ fti . 

pharmaceutical formulation described herein can include, as appropr^ 
additional carrier ingredients such as duner*. butlers, flavoring 
active agents, thickeners, lubricants, preserves Cmduding a««ox»iants) and the lr^ 
2 JLL included fcr the purpose of rencermg the formu^ rs«^ wA me 
blood cf^ intended recipient. . «_ rf „ 

The oblets or pflls can also be provided with an ente™ layer m the form of an 
envelope that serves to resist dismtegratton in the stomach and perm* the active 
uigredient to pass intact into the duodenum or to be delayed in release. A vane£ of 
II Jte used for such enteric lay«s or c«ti»gs, inetading pW aa* or 
2£ of adds wi* such maerial, as shellac, sheu* and <«<"*«£Z 
acetate, and the Hke. A particularly suitable enteric coating comprises a 
acid copolymer together with known materials that c^tribr* to ^ 

^n^ofinlubiungHIVm^ 

invention. A pharn^tica. composition a^.c^^^u^ 
the binding of gpl20r to HTV, is contacted with cells either in vitro or m vivo for a time 
period sufficient to signified, inhibit me binding ofHIV to tteceU surface. 

Compounds effective in this method include mannose carbohythates, fueose 
carbohydrates, plant lectins and pradimicin A antibiotics. Specially preferred 
compounds are mannose, fucose, mannan, concanavalin A and pradmucm Ajjfc 
pJLeuttcal compositiou of to ^ta^M*."^*f 
tohWts gplMr binding to HTV and may also include a physiologicaUy tolerable earner. 

of thepresent invention U preferably utilized to inhibit HIV infection 
of placental, brain, muscle, neural and colon cells. 
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A diagnostic method is also described in the present invention for detecting the 
presence, and preferably the amount, of HIV present in a fluid sample by producing a 
reaction product containing HIV bound to gpl20r. Those skilled in the art will recognize 
that there are well known clinical diagnostic procedures that can be utilized for the 
5 formulation and detection of such reaction products. Thus, while exemplary assay methods 
are described herein, the invention is not intended to be so limited. 

Various heterogeneous and homogeneous assay protocols can be employed for 
detecting the presence, and preferably the amount, of HIV in a fluid sample. For example, 
the present invention contemplates a method for assaying a sample, such as a body fluid, 
10 for the presence of HTV comprising the steps of: 

(a) admixing a fluid sample with gpl20r, either in solution or affixed to a solid 
matrix; 

(b) maintaining the admixture for a predetermined time period such as about 10 
minutes to about 16 - 20 hours and under biological assay conditions at a 

15 temperature of about 4°C to about 45°C that is sufficient for any HIV 

present in the sample to react with (bind) the gpl20r to form a reaction 
product; and 

(c) determining the presence of any reaction product that is formed, and thereby 
the presence of any HTV in the admixture. 

20 Preferably, the fluid sample is a body fluid sample, such as blood, plasma, serum, 

urine, saliva, semen or cerebrospinal fluid (CSF). 

The determination of the presence of a reaction product, either directly or 
indirectty, can be accomplished by assay techniques well known in the art such as by the 
use of an indicating or labelling means, as discussed hereinbelow. In a preferred 

25 embodiment, a labelled indicating means, such as a fluorescein-labeUed antibody, is 
capable of binding to the gpl20r present in the reaction product to form a labelled 
complex. Determining the presence of the labelled complex provides an assay for the 
presence of HIV in the sample. In particularly preferred embodiments, the amount of 
labelled indicating means bound as part of the complex is determined, and thereby the 

30 amount of HIV present in the sample is determined. When that amount is zero, no HTV is 
present in the sample, within the limits of detection. Methods for assaying the presence 
and amount of a labelled indicating means depend on the label used, such labels and assay 
methods being well known in the art. 

In a preferred embodiment, the gpl20r is affixed on a solid matrix to form a solid 

35 phase support. In that embodiment, the assay is heterogeneous, solid/liquid phase assay 
and, as such, has its own preferred manipulations. For example, following admixing of a 
liquid sample with a solid support containing gpl20r affixed thereto, the admixture is 
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didnot re ^wifl.ttesolid S OEP<>rt,«Kli«t>y™ aI «- Trns rernoves any mater*, p 

A labelled indicating means is then admixed with the separated sonu v 

TtoJ* solid/liquid phase labeUmg-reaction admixture which is 

present is determined. .... „ fers to Tjarameters that 

a« „«*. herein the term "biological assay conditions refers 10 paran 

^ inciudea te-r^aure range of abouU-Cto ^3 about one 
^^uieUs^^mebylO^.ess. For example, a temperature o, about 20 

""S" J . in* includes in an amount sufficient 

is also contemplated by the present invention. Such a kit incluaes ™ ™ 

nagalt A , ^ ted,, the term -tatructions lor use- typical* includes a tangible 

35 ^Asused herein, the term "package" refers to a solid material such as 3^**' 

paper, « me like capable of hoiding within fixed limi* the gplMr. and preferably 
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also a detection means. In one embodiment, the package can contain a microliter plate 
well to which microgram quantities of gpl20r have been operatively affixed, ie., linked so 
as to be capable of reacting with and bind HTV and/or gpl20. 

As used herein, the terms "label" "indicating means" and "labelled indicating 
means", in their various grammatical forms refer to single atoms and molecules that are 
either directly or indirectly involved in the production of a detectable signal to indicate or 
detect the presence of a reaction product. Such labels are themselves well known in 
clinical diagnostic chemistry and constitute a part of this invention only insofar as they are 
utilized with otherwise novel methods and/or systems. 

The indicating means can be a fluorescent labelling agent that chemically binds to 
antibodies or protein antigens without denaturing them to form a fluorochrome (dye) that is 
a useful immunofluorescent tracer. Suitable fluorescent labelling agents are fluorochrome, 
such as fluorescein isocyanate (FIC), fluorescein isothiocyanate (FITC), 5-dimethylamine- 
1-naphthalene sulfonyl chloride (DANSC), tet^emykhodamine isocyanate (TRITC), 
lissamine and the like. Immunofluorescence analysis techniques are well known in the art, 
and for example, is described in DeLuca, -Immunofluorescence Analysis" in 
TmrM nii wmMneB Analysis. Marchalonis et al., (1982) eds., John Wiley & Sons, Ltd., 
pp. 189-231, which is incorporated herein by reference. 

Other preferred indicating means are colorimetric agents and enzymes, such as 
horseradish peroxidase, glucose oxidase or the like, linked as described above, as well as 
radioactive elements, preferably an element that produces gamma ray emissions. Elements 
which emit gamma ray^such as 124 I, 125 I, 128 I, 132 I, and 51 Crrepresent one class of 
radioactive indicating groups. Another group of useful labelling means are those elements 
such as U C, 18 F, 15 0 and 13 N which emit positrons. The positrons so emitted produce 
gamma rays upon interaction with electrons present. 

Having generally described this invention, a further understanding can be obtained 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 

EXAMPLE 1 

Pinning and Isol ation nf Nnn-CD4 Gt)140 KeCCTtPr Protein 

Human placental membranes were found to be able to bind vaccinia derived 
recombinant gpl20 (vgpl20) with a Kd of 1.3 nM. At nM (concentrations) of gpl20 none 
of this binding was inhibited by an antibody (G17-2) which has been reported to efficiently 
block gpl20 binding to CD4 (Linsley et al. (1988) J. Virol. £2:3695-3702), as shown in 
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^ A piacental cDNA Htaary - oWained in the — » 
pCDMSandwasscreettd. A cDNA was isolated which expressed proton that exhibited 
l^affimtybiiKliiigfbrvgpl20iiithepresenceofO17-2. 

^^proteir! <»d as gpBO receptor (gpl20r), also bound lattve gpTCO 

directed against gpl20. 



EXAMPLE 2 



Tta binding of radiolabeled gpl20 to gpl20r expressed to ^J?l™ 
sadied. Pools of 90.000 cDNA molecules, obtained from a ^jf""*"* 

surface was identified by serening with eUher 1 nM of ^-labeW 

i*n ™- ^-nmlM by the method described ill Ktalowsb et aL, (1990) Antra. 

studies utilizing the transfected COS-7 cells ate shown in HGURE 1. 

nreincnbatton of the cells or GP120 at22-C with one or more of the Mowing: ant.-CD4 
S^SS <5 „g/ml>, baculovirus^ gp!20 (bgpBO. American 
200 fll ano,gpl20 monoclonal antibody 110.1 (25 1>— — 0» -ft j» 

(100 1), WUcose (100 mM), concanavaUn A (1 mg/nu) or pradmncm A (100 
Lc^ weren^^ateau^grapnyPdays). 
Sb, (A and B) fflustn* mat gp!20 binding to the gpl20r expressed on me ceUs 
^Ced by excess bgpl20, mannose, ta* pradimicin A, Co^anavata A, -d 
incubation whh antibody 110.1 but not by CD4, antibody G17-2, 
Itisera. Snidies were also carried out on gp!20 binding » ^ «• cells, 

^ns.ec.edwUh^Mmbymom^Pe^etal. <WC«»0-'B. 

Control studies of the binding of 125 I-labelled psoralen-TJV inactivated HTV-BRU 
» the gpl20r expressing COS-7 cells demonstrated binding of HV to gpl20r and btockage 

material to the cells in FIGOTE 1 is shown in Tabte. 1 
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FIGURE 1 
A 



LABELLED 
MATERIAL 

vgpl20+ 



vgpl20+ 



TABLE 1 



mMPgrmoN with 



017-2 
bgp!20 

HlVantuen 
D-galactoie 

D-mannoie 
L-fiicote 
conc*navalin A 
pradinucin A 

OKT4A 

N-acdytgaltctosamiw 
N-ftcetylglucosunine 



HIV-BRU+ 



mtnnofe^hoiphate 
tUlic acid 
huminlgE 



bgpl20 



-3 

CPM BOUND X 10 
GP120R CD4 



60 
60 
6 

63 
50 
6 
6 
8 
8 

60 
60 
30 
6 

60 
60 
60 

8 

2 



20 
5 
4 

3 

20 
20 
20 
6 
6 

5 

20 
20 
20 
20 
20 
20 

4 

2 



30 



35 



40 



Scatchard plots of gpl20 binding to placental membranes and to COS cells 
expressing the gpl20r were canied out in the presence and absence of a 200 fold excess of 
bgpl20 or ngpl20. The results, shown in FIGURE 2A, demonstrate a specific binding of 
vgpl20 to gpl20r with a Kd of 1.7 nM ± 0.4 (n=4) and of ngpl20 to gpl20r with Kd of 
1.8 nM ± 0.2 (n=4), with 150,000 and 149,000 receptors per cell, respectively. 
Concurrent analysis of gp 120 binding to CD4 expressed on COS cells gave a Kd of 4-5 nM 
in agreement with previous reports (linsley, P.S. et d. (1988) J. Virol f£:3695-3702; 
Schnittman, et al. (1988) J. Immunol. 141:4181-4186). Calculations from the association 
and dissociation rate constants gave a similar comparative result. Hie expressed gpl20r 
has a relative molecular mass (Mr) of "48,500 and a protein of similar size was also 
partially purified from placental membranes (FIGURE 2D). 

The placental membranes and COS cells were surface iodinated, and treated with 1 
nM unlabelled vgpl20, then washed with Blotto RPMI, 5% BSA, 1% Non-fat dry milk, 
0.2% sodium azide solubilized in Triton X-100 (1% in PBS with a protein inhibitor 
cocktail, PMSF, Pepstatin A, orthophenathroline and leupeptin) and immunoprecipitated 
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15 



20 



25 



30 



35 



with HIV or control human sera, according to the method described in Curtis et aL (1990) 

-5 b, and , minor U« «f 1* » which may represent an alhsnatrvely proceed 

^.a^ismore^sisBn.wimmesi.e.fmegpnOrcDNA. 

3 in an agarose gel, transferred to t6SBxM ^y^to & mr^mA^ 

"Sr RNA was highest in coion foUowrf by thymus, placena, 
tear, was not detecfcd in liver or kidney. levels of express™ 

SSC?-. B cell and maoophage (FIGURE 2E) require veriflcafion by polymery 
^aTLuon (PCR). Foil length CD4 RNA was highest in thymus, T cell, and 
macrophage followed by placenta and colon (notshown). , 
^^^^The gpl20r cDNA encodes a protein of 404 amino acids with a calculated Mr of 

"^ES/S** strands of gpl20r cDNA was carried out by the d* 

tala Z method. J-^*"^^"** 
^Lensus. Thepreo^cy^c*»na*hasasir^ 
scqne.ce homology to other type n membrane protein C-type lectms 

predicted^ part by homology to related sequences in FIGURE 3C. Thepc4en*aN- 
^glycos^on site is n^W by an asterisk. The start of the seven complete and 
eighth pLal tandem repeats are indicated (R1-R8). The consensus repeat sequence is 
rcQaWrc» LKAAVGELFEKSKLQE. Tie beginning of the lecun domams is also 
JLed V No signal sequence was apparent tat instead demonstrated homology to a 
^TtennX^wiuchu.^ 

science for membmne translocation. The "positive Inside rule' (von Hem*, G.« aL 
M88) Bur. J. Biochem. 17*671-678) for to sequence wiflun fifteen resulues of to 
Lmenfcrane region predicts a cytoplasmic amino terminus in agreement w^Jhe 
homology to membrane associated C-type iectin, wuh similar montane on^auo. 
(FIGURE 3C, (Spiess, M. (1990) Biochemistry 22:10009-10018). This remon, Met 1 to 
Ala76,representsthetatdoiramofthegpl20rsequence. 

The second domain (He 77 to Val 249) consists of tandem repeats of nearly 
identical sequence (FIGURE 3A). This region was predicted to consist of a senesof 
^hipamic a-helices interrupted by Mums. Circular Dichroism spectra . ^40* 
trifluoroemanol of a consensus repeat peptide begunung wuh the Mn, 
PEKSKLQEIYQELTQLKAAVGEL (single-letter ainino-acid code), demonstrated an all a- 
helical structure (not shown). Homology to other repeat domains suggested three possible 
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tertiary structures, (1) antiparallel helix bundles, (2) a multimeric parallel helix bundle, and 
(3) a membrane pore with a hydrophobic exterior and a negatively charged interior. The 
first two models would function as spacers to separate the lectin domain from the 
membrane, while the third could generate a transmembrane signal after ligand binding. 

5 The third domain (Cys 253 to Ala 404) is homologous to the other known C-type 

lectins which are type H membrane proteins (FIGURE 3C). With the exception of the 
IgEr, these lectins bind terminal D-galactose and ^-acetylgalactosamine of glycoproteins 
(Spiess, M. (1990) Biochemistry 22: 10009-10018). 

The most closely related sequences were the group of Type n membrane protein C- 

10 type lectins: Chick hepatic lectin (CHL) (Drickamer, K.J. (1981) Biol. Chem. 25fi:5827- 
5839), low affinity IgE receptor (IgEr) (Kikutani, H. et al. (1986) Cell 4Z: 657-665), the 
aaaloglycoptorein receptors (human HI and H2 (Spiess, M. et al. (1985) Proc. Natl. 
Acad. Sci. USA 52:6465-6569) are shown), and the rat Kupffer cell receptor (Hoyle, 
G.W. et al. (1988) J. Biol. Chem. 262:7487-7492) . The most similar mannose binding 

15 lectin was one of the eight carbohydrate recognition domains of the human macrophage 
mannose receptor (Mannr) (Taylor, M.E. et al. (1990) J. Biol. Chem. 261:12156-12162; 
Ezekowitz, R.A.B. et al. (1990) J. Exp. Med. 12:1785-1794). Residues identical to the 
gpl20r are boxed. ALIGN scores indicate significant sequence similarity if greater than 
3.0. The complete gpl20r sequence was most homologous to the Kupffer cell receptor 

20 which has a similar tandem repeat (Hoyle, G.W. et al. (1988) J. Biol. Chem. 261:7487- 
7492). 

The inability to crosslink gpl20 to the non-CD4 sites on placenta and brain cell 
lines (not shown) was consistent with an interaction of the gpl20r with carbohydrate, and 
polyclonal HIV antisera added to gpl20 blocked binding to CD4 but not to the gpl20r 

25 (FIGURE IB). Galactose and N-acetylgalactosainine did not block gpl20 binding, but 
mannose and fucose completely blocked binding to the gpl20r without an effect on CD4 
(FIGURE IB). Inhibition by a series of sugars is shown in FIGURE 2B. Human IgE (10 
/ig/ml), sialic add (100 mM), and mannose-6-phosphate (100 mM) had no effect on 
binding to the gpl20r. The three forms of gpl20 used have different oligosaccaride 

30 structures. Bgpl20 contains only high mannose structures (Hsieh, P. et al. (1984) J. Biol. 
Chem. 252:2375-2382). Vgpl20 has equal proportions of high mannose and complex 
(Mizuochi, T. et al. (1988) Biochem. J. 254:599-603) similar to ngpl20 which has a 
greater structural diversity in the complex chains (Geyer, H. et al. (1988) J. Biol. Chem. 
261:11760-11767; Mizuochi, T. et al. (1990) J. Biol. Chem. 261:85 19-8524). The 

35 affinity of the gpl20r for all three forms was similar (FIGURE 2A ) suggesting that the 
terminal mannose of high mannose chains are the primary determinants of binding. As 
expected for a C-type lectin the gpl20r required calcium and binding was blocked by 
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10 



15 



nnrnm to The eol20r carbohydrate specificity is more closely related to 
EGTA (FIGURE 2B). The mannose-spedfic endocytosis 

serum mannose -binding proteins and to the ™ ™>™ J£ ^ Chem . 

-ep- fcund in nonages f £ 12:1785-1794) 

— s of ^did noTU a Mr WGO - 
(FIGURE 3C). U ' 2D) with a reported concentration of 150- 

from placental membranes (FIGUKb zu; cm 

300 nil for gp!20 saturation of the macrophage receptor (Hrlan, M. et al. (1989) 



20 



25 



mannose-binding protein (Ezekowitz, R.A.B. et al. (uwj *\ ^ 

a,,^ (Larking M. et aL (1989) AIDS V TO m ^\ a ^ 

gpUO blocked binding to to IPBOT X CD4 (PGBBB IB), 

Lance of receptor interaction. The antibiotic pradumcm A ^ 

™Zflve T«L and this inhibitory effect is prevented by mm •>> 
CWpoanreiceu. jnaanid,, blocked gpl20 binding to 

^ - f^^m^ BOri only »— binding to » gpKOr 

(tisjVKt^o). j»oiiiia« /CTi-TTOP 9FH sueeesting that in addition 

^^opoagcs, observe * 

to CD4 the gpl20r may be important for HIV tart* ™ ffld ^ ^ 

the gpl20r rapidly internalized its bound ^ Tsupport this 

radiolabeled HTV in a gpl20 dependent fashion (HOUSE 1Q also suppo 



^ foregoing descriptfon and Samples are intrad* as uhrstrauvecftheprese* 

— varans and modify .nay be effect 
without departing from the tree spirit and scope of the present invention. 
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OTnTTTRT ^TF. LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Curtis, Benson 

(ii) TITLE OF INVENTION: INHIBITION OF NON-CD4 MEDIATED 

HTV INFECTION 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Bristol-Myers Squibb Company 

(B) STREET: 3005 First Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98121 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1 .25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US UNKNOWN 

(B) FILING DATE: ll-JUL-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sorrentino, Joseph M. 

(B) REGISTRATION NUMBER: 32,598 

(C) REFERENCE/DOCKET NUMBER: ON0086- 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 728-4800 

(B) TELEFAX: (206)448-4775 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

09 TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: ^ear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human immunodeficiency virus type 1 

(a) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42..1253 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTAAAGCAGG AGTTCTGGAC ACTGGGGGAG AGTGGGGTGA C ATG AGT GAC TCC 

1 

ss=s====s===== =! s 

s=s=s=========== 

40 45 

==s===s==s==== == 

Gin Glu Gin Ser Arg Gin Asp Ala lie Tyr 
70 75 

Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys b ^ 



90 95 
85 9U 



53 



101 



149 



197 



245 



293 



341 
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TAC CAG GAG CTG ACC CAG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG 
£r Gin Glu Le« Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
* 105 "° 115 

AAA TCT AAG CTG CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG OCT 
Lys ser Lys Leu Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala 

GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CTG CAG GAG ATC TAC CAG 
Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu He Tyr Gin 
135 140 145 

GAG CTG ACC TGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT 
Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser 
150 155 160 

AAG ATG CAG GAG ATC TAC CAG GAG CTG ACT CGG CTG AAG GCT GCA GTG 
Lys Met Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val 
155 170 l ?s 

GGT GAG CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG 
Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu 
185 190 I 95 

ACC CGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CAG 
Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin 
200 205 210 

CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT GCA GTG GGT GAG 
Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu 
215 220 225 

CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG ACC CAG 
Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Gin 
230 235 240 

CTG AAG GCT GCA GTG GAA CGC CTG TGC CAC CCC TGT CCC TGG GAA TGG 
Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys Pro Trp Glu Trp 
245 250 255 260 

ACA TTC TTC CAA GGA AAC TGT TAC TTC ATG TCT AAC TCC CAG CGG AAC 
Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn Ser Gin Arg Asn 
265 270 275 

TGG CAC GAC TCC ATC ACC GCC TGC AAA GAA GTG GGG GCC CAG CTC GTC 
Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly Ala Gin Leu Val 
280 285 290 

GTA ATC AAA AGT GCT GAG GAG CAG AAC TTC CTA CAG CTG CAG TCT TCC 
Val He Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin Leu Gin Ser Ser 
295 300 305 



389 



437 



485 



533 



581 



629 



677 



725 



773 



821 



869 



917 



965 
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1061 



1109 



1157 



1205 



1253 



s=====s=s==5 ==5:s: 

310 315 
GGC AC6 TGG CAA TGG GTG GAC GGC TCA CCT CTG TTG CCC AGC TTC AAG 
Gly Thr Trp Gin Trp Val Asp Oly Ser Pro Leu Leu Pro » ^ 
325 330 

345 350 

Ala Glu Phe Ser Gly Asn Gly Trp Asn Asp Asp Lys Cys Asn 
360 365 

375 380 

=====ss==s=5==== 

390 395 
TAGCAGAACT TCACCCCCTT TTAAGCTACA GTTCCIXCTC TCCATCCTTC GACCTTTAG 1312 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino adds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Htt Ser Asp ser Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu 
1 5 10 

Glu Glu Gin Leu Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys 

20 25 

, ~, To „ n v His Glv Pro Leu Val Leu Gin Leu Leu 
Ser Leu Ala Gly Cys Leu Gly Hxs exy fro i** 

35 40 4 

Ser Phe Thr Leu Leu Ma Gly Leu Leu Val Gin Val Ser Lys Val Pro 



cc 60 
50 55 
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s . r s.r II. Ser «. «. «- — «■ 01 " »• "* ^ IS 

65 /U 



Leu Thr Gin Leu Lys Ala 



Ala Val Gly Olu Leu Ser Glu Lys Ser Lys 



85 



90 



Mu «. u. ». T»r «. «. »» * «■ « *r. «■ « "» 

100 1Ub 



01o _ Pro «■ ,v. Ser - - «» »° W < S 

115 120 

T ii. ai a Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin 
Arg Leu Lys Ala Ala Val exy ^ ^ Q 

130 X3S> 
Olu He Tyr Gin Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu 



145 



150 



Pro Glu Lys Ser Lys 
165 



Met Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu 



170 



ly . *U AU vaX «, 01« M. » «- ~ "° S " e 

180 185 

Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 

195 200 
Ly s ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala 

210 215 
A la Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu Xle Tyr Gin 
225 230 " 5 

0 l» Leu Thr Gin Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys 

245 250 

Pro Trp «» Trp Thr Phe Phe «. «J — <*' ™ «- -J S.r «n 

260 265 
Ser Gin Arg Asn Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly 



275 



280 



«, «u v.! V.1 U. L». ser ala «b «. °£ — — *- 

290 295 

«. Ser ser «, ser >.» *ro Phe Thr Trp Het «, Leu Ser £J 



305 



310 



„. „ «,'«» «T Thr Trp «. Trp val Mp Ml ser Pro Lj. Le» 

325 330 



Pro ser Phe Lys Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly 



340 



345 
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01o 01 u „ C. «. - JJJ - «* - - " - ~ 

355 360 

U . ,,s P.. «, W . - - - - *- 



375 



Cys Asn 
370 

s « K9 «. «- - - - - ~ r"*™- 

385 390 
Pro Pro Pro Ala 

(2 ) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 

(B) TYPE: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 



127 amino adds 
amino add 
linear 

protein 

internal 



Human immunodefidency virus type 1 



(xi) 



SEQUENCE DESCRIPTION: 



SEQIDNO: 3: 



C HiS *~ Cys Pro Trp «. - J - - ^ T *° 

1 5 



Pne Met Ser Asn Ser Gin Arg Asn 



Trp His Asp ser He Thr Ala cys 



20 



25 



30 



Lys 



«. «, ». «. - « « « - ~ 2T 01 ° 01 ° 

35 40 



r1n ser ser Arg ser Asn Arg Phe Thr Trp Met 
As n Phe Leu Gin Leu Gin Ser Ser Arg ^ 

50 



55 



81y « s« « ». r «- ~ -» »- *r w " w vai Mp y 

„ »- » 3« m W = - ^ "» ^ « F " 



65 
Ser 
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M „ „ m oiy «. «u «P W «" « s « Sly S ** 

100 105 

*,„ «P *=P L, S <** «n » M. W . — "» »« S * " 

115 120 ■ 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

cy9 Gly Ala Gin Ser Arg Gin Trp Glu Tyr Phe Glu Gly Arg Cys Tyr 
1 5 10 

^ Phe Ser Leu Ser Arg Met Ser Trp His Lys Ala Lys Ala Glu Cys 

20 25 
Glu Glu Met His Ser His Leu lie He He Asp Ser Tyr Ala Lys Gin 

Asn Phe Val Met Phe Arg Thr Arg Asn Glu Arg Phe Trp He Gly Leu 



50 



55 



Thr Asp Glu Asn Gin Glu Gly Glu Trp Gin Trp Val Asp Gly Thr Asp 



65 



70 



75 



(2) 



Thr Arg Ser Ser Phe Thr Phe Trp Lys Glu Gly Glu Pro Asn Asn Arg 
85 90 

Gly Phe Asn Glu Asp Cys Ala His Val Trp Thr Ser Gly Gin Trp Asn 
100 105 

ASP Val Tyr Cys Thr Tyr Glu Cys Tyr Tyr Val Cys Glu Lys 
115 120 



INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) 



SEQUENCE DESCRIPTION: SEQIDNO: 5: 

cys Asn Thr Cy 8 Pro Glu Lys Trp lie Asn Phe Gin Arg Lys Cys Tyr 
1 5 10 

Tyr Phe Gly Lys Gly Thr Lys Gin Trp Val His Ala Arg Tyr Ala Cys 



20 

Asp Asp Met Glu Gly Gin Leu Val Ser lie His Ser Pro Glu Glu Gin 
35 40 « 

Asp Phe Leu Thr Lys His Ala Ser His Thr Gly Ser Trp He Gly Leu 
50 55 60 

Arg Asn Leu Asp Leu Lys Gly Glu Phe lie Trp Val Asp Gly Ser His 
65 ™ 75 

Val Asp Tyr Ser Asn Trp Ala Pro Gly Glu Pro Thr Ser Arg Ser Gin 
85 9° 95 

Gly Glu Asp Cys Val Met Met Arg Gly Ser Gly Arg Trp Asn Asp Ala 
100 "0 

Phe cys Asp Arg Lys Leu Gly Ala Trp Val Cys Asp Arg 



115 



120 



(2) INFORMATION FOR SEQIDNO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Arg Thr Cys Cys Pro Val Asn Trp Val GXu His Glu Arg Ser Cys Tyr 
15 10 15 

Trp Phe Ser Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys 
20 25 30 

Arg Leu Glu Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe Val Gin His His lie Gly Pro Val Asn Thr Trp Met Gly Leu 
50 55 60 

His Asp Gin Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu 
65 70 75 80 

Thr Gly Phe Lys Asn Trp Arg Pro Glu Gin Pro Asp Asp Trp Tyr Gly 
85 90 95 

His Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly 
100 105 HO 

Arg Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr Arg Trp Val Cys Glu 
115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO: 7: 

arc, Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Gly Ser Cys Tyr 
1 5 10 

Trp Phe ser His Ser Gly Ly S Ala Trp Ala Glu Ala Glu Lys Tyr Cys 

20 25 
Gin Leu Glu Asn Ala His Leu Val Val lie Asn Ser Trp Glu Glu Gin 

35 40 45 

Lys Phe He Val Gin His Thr Asn Pro Phe Asn Thr Trp He Gly Leu 



50 



Thr Asp ser Asp Gly Ser Trp Lys Trp Val Asp Gly Thr Asp Tyr Arg 



65 



70 



His Asn Tyr Lys Asn Trp Ala Val Thr Gin Pro Asp Asn Trp His Gly 

85 90 



His Glu Leu Gly Gly Ser Glu Asp Cys Val Glu Val Gin Pro Asp Gly 



105 



100 

^ Trp Asn Asp Asp Phe Cys Leu Gin Val Tyr Arg Trp Val Cys Glu 



115 I 20 



Lys 



(2) INFORMATION FOR SEQIDNO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) 



SEQUENCE DESCRIPTION: SEQIDNO: 8: 



Leu Gin Leu He Met Gin Asp Trp Lys Tyr Phe Asn Gly Lys Phe Tyr 
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^ Ph. S.r Ar, Asp W . ~ ttp Hi. Glu M. 01» ™ <*» 

20 „ 

. y ,i a eor Val Thr Ser Gin Glu Glu Gin 
Val Ser Gin Gly Ala His Leu Ala Ser Val Tnr » ^ 

35 40 
Ala Ph. L. V.1 tt U. Thr ».» XI. V.! «P Hi. Trp He Cly — 

50 55 
* Asp cl» 01, Thr Olu 01, - Trp *, Trp v.1 »P 01, Thr »» 

70 

1 «P T,r v.1 olh »r AT, Ph. Trp « g Lye 01, 01. «o »P 

85 9U 

A.„ Trp «, Hi. Ox - 01, 01* £ «. U, V £ - «» 

100 105 

^ Met Trp Asn Asp Met Ala Cye Gly Thr Ala Tyr Asn Trp Val Cys 



115 120 



Lys Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: Unear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Thr His cys Pro Ser Gin Trp Trp Pro Tyr Ala Gly His Cys Tyr 
1 5 10 

Lys lie His Arg Asp Glu Lys Lys lie Gin Arg Asp Ala Leu Thr Thr 

20 25 
cys Arg Lys Glu Gly Gly Asp Leu Thr Ser He His Thr He Glu Glu 



35. 40 
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« »P ». n. n. ser S » » «T - «- « - - 

• 50 55 

„ n. o ly M u « MP n. «r- XI. om m* Tyr M «. Trp ser 

65 70 

Mp oly fc Pro v.. « P.- ~ «. TJP - «T « « ~ 

85 

. n« siu Asp cys Val val Met Lys Gly Lys Asp 
His Glu Asn Asn Arg Gin Glu Aap cys ^ 

100 los 

, civ Cvs Glu Trp Pro Leu Gly Tyr He Cys 

Gly Tyr Trp Ala Asp Arg Gly cys gj.u xr F 125 



115 



Lys Met 
130 
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We claim : 

1. A method of inhibiting HIV infection of mammalian cells comprising contacting the 
cells with an effective amount of a compound selected from the group consisting of 
a mannose carbohydrate, a fiicose carbohydrate, a lectin and a drug, for a time 
period sufficient to significantly inhibit the binding of HIV to a non-CD4 cell 
surface protein. 

2. The method of Claim 1, wherein the non-CD4 cell surface protein is a gpl20 
receptor having a specific binding affinity for gpl20 of about Kd = 1.3 nM to 
about Kd = 2.0nM. 

3. The method of Claim 2, wherein the gpl20 receptor is present on placental cells. 

4. The method of Claim 2, wherein the gpl20 receptor is present on muscle cells. 

5. The method of Claim 2, wherein the gpl20 receptor is present on neural cells. 

6. The method of Claim 5, wherein the neural cells are brain cells. 

7. The method of Claim 5, wherein the neural cells are dendritic cells. 

8. The method of Claim 2, wherein the gpl20 receptor is present on mucosal cells. 
10. The method of Claim 1 , wherein the compound is mannose. 

10. The method of Claim 1, wherein the compound is fiicose. 

11. The method of Claim 1, wherein the compound is a mannose-containing 
carbohydrate. 

12. The method of Claim 11, where the carbohydrate is mannan. 

13. The method of Claim 1 , wherein the compound is a pradimicin A antibiotic. 

14. A substantially purified non-CD4 gpl20 receptor protein comprising a protein 
substantially corresponding to a non-CD4 mammalian cell surface protein that has a 
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specific binding affinity for gpl20, said protein containing about 400 amino acid 
residues, having a molecular weight of about 45,000 daltons and havmg a binding 
affinity for gpl20 characterized by a Kd of about 1 .3 nM to about 2 nM. 

15 Hie gpl20 receptor protein of Claim 14, wherein the binding of the gpl20 receptor 
protein to gpl20 is inhibited by a compound selected from the group consisting of a 
mannose carbohydrate, a fucose carbohydrate, a lectin and a drug. 

16. The gpl20 receptor of Claim 15, wherein the compound is mannose. 

17. The gpl20 receptor protein of Claim 15, wherein the compound is a pradimicin A 

antibiotic. 

18. The gpl20 receptor protein of Claim 14, wherein the protein is produced by 

recombinant means. 

The gpl20 receptor protein of Claim 18, wherein said recombinant means 
comprises the cloning of a cDNA isolated from a library of recombinant placental 
genes. 



19. 



20 A DNA molecule encoding the gpl20 receptor protein of Claim 14, wherein the 
DNA is a complementary DNA that transcribes an mRNA found in cells selected 
from the group consisting of placental cells, brain cells, muscle cells and colon 
cells. 

21 A method of detecting the presence of HIV in a sample comprising: 

(a) admixing in an aqueous medium a sample to be assayed with a non- 
CD4 gpl20 receptor protein having a specific binding affinity for gpl20 
characterized by a Kd of about 1.3 nM to about 2.0 nM in an amount 
sufficient to carry out at least one assay; 

(b) maintaining the admixture for a time period sufficient for the gpl20 
receptor protein to bind to any HTV present in the sample and form a 
reaction product; and 

(c) determining the presence of the HTV containing reaction product. 
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B, mdhod of data. 21, wherein to spl20 recptor profcin •"•*»*• 

daltons. 

Tte met hod of Claim 2 1 , wherein the gp 120 receptor protein is affixed to a 
solid matrix to form a solid support. 

_ . « of claira 2i wherein the presence of the reaction product is 
bound gpl20 receptor protein. 

m .to* of Claim 24, wherein the reagent U a M< —* *— 
against the gpl20 receptor protein. 

A diagnostic system in kit form, for assaying for * 

fluid sample, comprising a package contammg a non-CD4 receptor pro 

L, a Ufle for *> 12 ° ChanCteriZed ^ ' 

to about 2.0 nM, and instructions for use. 

• *- m nf riaim 26 wherein the non-CD4 gpl20 receptor 
The diagnostic system of Claim zo, wiih« 

protein is affixed to a solid matrix to form a solid support. 
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Figure 1A 
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Figure IB 
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Figure 1C 
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Bound (nM) 



Figure 2A 
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Log [Competitor] (M) 



Figure 2B 
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Time (minutes) 



Figure 2C 
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Figure 2D 
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Figure 2E 
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1 C TAAAGCAGGAGTTCTGGACACTGGGGGAGAGTGGGGTGAC 

42 ATGAGTGACTCCAAGGAACCAAGACTGCAGCAGCTGGGCCTCCTGGAGGAGGAACAGCTG 
1 MS DSKEPRLQQLGLLEEt.y 

10 2 AGAGGCCTTGGATTCCGAC^^ 
21 R G L G FRQTRGYKjs1<a« 

162 GGTCCCCTGGTGCTGCAACTC^^ 

41 <; P L v T, O L — L — S — F — T — L — h 



Rl 



,82 owwmchmi*^^ 

R2 

342 TACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCTG 
101 YQELTQLKAA VGELft-^ 

4« TCTA&GCTG^GGAGATCTACOTG^GCTGACCTGGCTGMGGCTGCAGTGOTTGAGCTT 



R4 

522 ' 



... CCAGAGAAATCTAAGATGCAGGAGATCTACCAGGAGCTGACTCGGCTGAAGGCTGCAGTG 
161 PEKSKMQEIYQELTRLKAAV 



R5 



582 GGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGG^ 

R6 

642 GCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAG^GGAGATCTACCAGGAGCTGACC 
201 AAVG ELPEKSKQQ 

R7 

702 CGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAM^TCTAAGCAGCAGGAGATCTACCAG 
221 RLKAAVGELPEKSKQQEIYQ 

R8 

762 GAGCTGACCCAGCTGAAGGCTGCAGTGGAACGCCTGTGCCACCCCTGTCCCTGGGAATGG 
241 ELTQLKAAVERL CHPCFw 

822 ACAOTCOTCCAAGGAAACTGTTACT^ 



Figure 3A 
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882 ATCACCGCCTGCAAAGAAGTGGGGGCCCAGCTCGTCGTAATCAAAAGTGCTGAGGAGCAG 

281 ITACKEVGAQLVVIKSAEEQ 

942 AACTTCCTACAGCTGCAGTCTTCCAGAAGTAACCGCTTCACCTGGATGGG ACTTTCAGAT 

301 NFLQLQSSRSNRFTWMGLSD 

1002 CTAAATCAGGAAGGCACGTGGCAATGGGTGGACGGCTCACCTCTGTTGCCCAGCTTCAAG 

321 L NQEGTWQWVDGSPLLPSFK 

1062 CAGTATTGGAACAGAGGAGAGCCCAACAACGTTGGGGAGGAAGACTGCGCGGAATTTAGT 

341 QYWNRGEPNNVGEEDCAEFS 

1122 GGCAATGGCTGGAACGACGACAAATGTAATCTTGCCAAATTCTGGATCTGCAAAAAGTCC 

361 GNGWNDDKCNLAKFWICKKS 

1182 GCAGCCTCCTGCTCCAGGGATGAAGAACAGTTTCTTTCTCCAGCCCCTGCCACCCCAAAC 

381 AASCSRDEEQFLSPA PATPN 

1242 CCCCCTGCTGCGTAGCAGAACTTCACCCCCTTTTAAGCTACAGTTCCTTCTCTCCATCCT 

401 P P P A *** 

1302 TCG ACCTTTAG 



Figure 3A(cont.) 
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Figure 3B 
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Figure 3C 
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